Goto

Collaborating Authors

 practical debugging tool


Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks

Neural Information Processing Systems

When engineers train deep learning models, they are very much flying blind. Commonly used methods for real-time training diagnostics, such as monitoring the train/test loss, are limited. Assessing a network's training process solely through these performance indicators is akin to debugging software without access to internal states through a debugger. To address this, we present Cockpit, a collection of instruments that enable a closer look into the inner workings of a learning machine, and a more informative and meaningful status report for practitioners.


Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks

Neural Information Processing Systems

When engineers train deep learning models, they are very much "flying blind". Commonly used methods for real-time training diagnostics, such as monitoring the train/test loss, are limited. Assessing a network's training process solely through these performance indicators is akin to debugging software without access to internal states through a debugger. To address this, we present Cockpit, a collection of instruments that enable a closer look into the inner workings of a learning machine, and a more informative and meaningful status report for practitioners. These instruments leverage novel higher-order information about the gradient distribution and curvature, which has only recently become efficiently accessible.